Ensemble Methods for Noise Elimination in Classification Problems
نویسندگان
چکیده
Ensemble methods combine a set of classifiers to construct a new classifier that is (often) more accurate than any of its component classifiers. In this paper, we use ensemble methods to identify noisy training examples. More precisely, we consider the problem of mislabeled training examples in classification tasks, and address this problem by pre-processing the training set, i.e. by identifying and removing outliers from the training set. We study a number of filter techniques that are based on well-known ensemble methods like cross-validated committees, bagging and boosting. We evaluate these techniques in an Inductive Logic Programming setting and use a first order decision tree algorithm to construct the ensembles.
منابع مشابه
ADABOOST ENSEMBLE ALGORITHMS FOR BREAST CANCER CLASSIFICATION
With an advance in technologies, different tumor features have been collected for Breast Cancer (BC) diagnosis, processing of dealing with large data set suffers some challenges which include high storage capacity and time require for accessing and processing. The objective of this paper is to classify BC based on the extracted tumor features. To extract useful information and diagnose the tumo...
متن کاملOptimum Ensemble Classification for Fully Polarimetric SAR Data Using Global-Local Classification Approach
In this paper, a proposed ensemble classification for fully polarimetric synthetic aperture radar (PolSAR) data using a global-local classification approach is presented. In the first step, to perform the global classification, the training feature space is divided into a specified number of clusters. In the next step to carry out the local classification over each of these clusters, which cont...
متن کاملFault Detection of Anti-friction Bearing using Ensemble Machine Learning Methods
Anti-Friction Bearing (AFB) is a very important machine component and its unscheduled failure leads to cause of malfunction in wide range of rotating machinery which results in unexpected downtime and economic loss. In this paper, ensemble machine learning techniques are demonstrated for the detection of different AFB faults. Initially, statistical features were extracted from temporal vibratio...
متن کاملA Unique Approach of Noise Elimination from Electroencephalography Signals between Normal and Meditation State
In this paper, unique approach is presented for the electroencephalography (EEG) signals analysis. This is based on Eigen values distribution of a matrix which is called as scaled Hankel matrix. This gives us a way to find out the number of Eigen values essential for noise reduction and extraction of signal in singular spectrum analysis. This paper gives us an approach to classify the EEG signa...
متن کاملValidation of Synoptic Station Data Using Ensemble Classification on Central Iran
Today, the use of data recorded in synoptic stations of the country is one of the most significant sources of applied research for researchers. Data recorded automatically or manually at synoptic, climatological, and other stations are analyzed for statistical analysis. In this research, the data recorded in the synoptic stations of Iran, which are used to determine the days of dust, were analy...
متن کامل